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FLUID MODEL FOR A NETWORK OPERATING UNDER 
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University of Cambridge and University of California 

We consider a model of Internet congestion control that repre- 
sents the randomly varying number of flows present in a network 
where bandwidth is shared fairly between document transfers. We 
study critical fluid models obtained as formal limits under law of 
large numbers scalings when the average load on at least one re- 
source is equal to its capacity. We establish convergence to equilibria 
for fluid models and identify the invariant manifold. The form of the 
invariant manifold gives insight into the phenomenon of entrainment 
whereby congestion at some resources may prevent other resources 
from working at their full capacity. 

1. Introduction. Roberts and Massoulie [19] have introduced and stud- 
ied a flow-level model of Internet congestion control, that represents the 
randomly varying number of flows present in a network where bandwidth is 
dynamically shared between flows that correspond to continuous transfers 
of individual documents. This model assumes a "separation of time scales" 
such that the time scale of the flow dynamics (i.e., of document arrivals and 
departures) is much longer than the time scale of the packet level dynamics 
on which rate control schemes such as TCP converge to equilibrium. 

Subsequent to the work of Roberts and Massoulie, assuming exponentially 
distributed document sizes, de Veciana, Lee and Konstantopoulos [9] and 
Bonald and Massoulie [2] studied the stability of the flow-level model oper- 
ating under various bandwidth sharing policies, where a bandwidth sharing 
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policy corresponds to a generalization of the notion of a processor sharing 
discipline from a single resource to a network with several shared resources. 
Lyapunov functions constructed in [9] for weighted max-min fair and propor- 
tionally fair policies, and in [2] for weighted a-fair policies [a € (0,oo)] [17], 
imply positive recurrence of the Markov chain associated with the model 
when the average load on each resource is less than its capacity. 

As a mechanism for performance analysis, we propose to use critical fluid 
models and related Brownian models to explore the behavior of flow-level 
models operating under weighted a-fair bandwidth sharing policies in heavy 
traffic. We are particularly interested in manifestations of the phenomenon 
of entrainment, whereby congestion at some resources may prevent other 
resources from working at their full capacity. As a first step in this explo- 
ration, in this paper we consider critical fluid models, obtained as formal 
limits under law of large numbers scaling, from the flow-level models with 
exponentially distributed document sizes and operating under weighted a- 
fair bandwidth sharing policies. The term critical refers to the fact that the 
nominal (or average) load on at least one resource is equal to its capacity, and 
for the other resources their nominal loads do not exceed their capacities, see 
(11) and (12). We identify the invariant states for the critical fluid models 
and we study the convergence to equilibria of critical fluid model solutions as 
time goes to infinity. Extrapolating from results for open multiclass queue- 
ing networks, we conjecture that such behavior is key to establishing heavy 
traffic diffusion approximations (also called Brownian models) for these flow- 
level models. We indicate the natural diffusion approximations suggested by 
our fluid model results. 

There are several motivations for our work. One source of motivation lies 
in fixed point approximations of network performance for TCP networks (cf. 
[[7], [12], [20]]). These approximations require, as input, information on the 
joint distribution of the numbers of flows present on different routes, where 
dependencies between these numbers may be induced by the bandwidth 
sharing mechanism. Similarly, an understanding of such joint distributions 
seems important if the performance models for a single bottleneck described 
by Ben Fredj, Bonald, Proutiere, Regnie and Roberts [1] are to be general- 
ized to a network. Another motivation is that the flow-level model typically 
involves the simultaneous use of several resources. With exponential doc- 
ument sizes, this model can be equated (in distribution) with a stochastic 
processing network (SPN) as introduced by Harrison [13, 14]. Open mul- 
ticlass queueing networks are a special case of SPNs without simultaneous 
resource possession. For such networks operating under a head-of-the-line 
service discipline, it has been shown [5, 21] that suitable asymptotic behav- 
ior of critical fluid models implies a property called state space collapse, 
which validates the use of Brownian model approximations for these net- 
works in heavy traffic. For more general SPNs, investigation of the behavior 
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of critical fluid models, of a related notion of state space collapse, and of 
the implications for diffusion approximations, are in the early stages of de- 
velopment. The analysis in this paper can be viewed as a contribution to 
such an investigation for models involving simultaneous resource possession. 
Finally, although we restrict to exponential document sizes in this paper, we 
would like to relax that assumption in future work. Although this involves 
a significantly more elaborate stochastic model to keep track of residual 
document sizes (because of the processor sharing nature of the bandwidth 
sharing policy), knowing the results for exponential document sizes is likely 
to be useful for such work. 

In order to state our results for the fluid model and conjectures for dif- 
fusion approximations, we need first to define the network structure, the 
weighted a-fair bandwidth sharing policy and the stochastic model. This 
is done in Sections 2-4. The notion of a fluid model solution is defined in 
Section 5 and we state our main results there. The proofs of these results 
are given in Section 6. Appendix A develops some properties of the function 
that defines bandwidth allocations and Appendix B shows that our defini- 
tion of a fluid model solution is reasonable in that fluid model solutions can 
be obtained as limit points of the stochastic model under fluid (or law of 
large numbers) scaling. 

Notation. For each positive integer d> 1, M. d will denote d-dimensional 
Euclidean space and the positive orthant in this space will be denoted by 
= { x £ R d : Xi > for i = 1, . . . ,d}. The Euclidean norm of x £ R d will 
be denoted by ||sc||. Inequalities between vectors in W d will be interpreted 
componentwise, that is, for x,y E M. d , x < y is equivalent to Xi < yi for i = 
1, . . . , d. Given a vector x € the dx d diagonal matrix with the entries of x 
on its diagonal will be denoted by diag(x). For positive integers d\ and d2, 
the norm of a d\ x tfe matrix A will be given by 



The set of nonnegative integers will be denoted by Z + and the set of points 
in with all integer coordinates will be denoted by Z^. A sum over an 
empty set of indices will be taken to have a value of zero. The cardinality of 
a finite set S will be denoted by \S\. 

2. Network structure. We consider a network with finitely many re- 
sources labelled by j G J . A route i is a nonempty subset of J (interpreted 
as the set of resources used by route i). We are given a set X of allowed 
routes. We assume that J and T are both nonempty and finite. Let J = \J~\, 
the total number of resources, and I = \I\ , the total number of routes. Let A 
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be the J x I matrix containing only zeros and ones, denned such that Aji = 1 
if resource j is used by route i and Aji = otherwise. Our assumption that 
each route i identifies a nonempty subset of J implies that no column of A is 
identically zero. We assume that A has rank J, so that it has full row rank. 
We further assume that capacities (Cj :j £ J) are given and that these are 
all strictly positive and finite. 

3. Bandwidth sharing policy. Bandwidth is allocated dynamically to the 
routes according to the following bandwidth sharing policy, which was first 
introduced by Mo and Walrand [17]. (To see how this fits into a stochastic 
model for the network dynamics, see Section 4.) 

Given a fixed parameter a £ (0, oo) and strictly positive weights (/tj : i £ 
X), if Ni(t) denotes the (random) number of flows on route i at time t for 
each i £ X and N(t) = (Ni(t) :i£l), then the bandwidth allocated to route i 
at time t is given by Aj(iV(t)) and this bandwidth is shared equally amongst 
all of the flows on route i. The function A(-) = (Aj(-) :i € X) is defined as 
follows (we define it on all of R+ as we shall later apply it to fluid analogues 
of N). Let A : R\ -> R\ be defined such that for each n £ R+, Aj(n) = for 
i £ Io(n) = {I £ X: n\ = 0}, and when X + (n) = {/ £ X: n\ > 0} is nonempty, 
A + (n) = (Aj(n) : % G X + (n)) is the unique value of A + = (A, : i £ X + (n)) that 
solves the optimization problem 

(1) maximize G n (A + ) 

(2) subject to ^2 AjiAi<Cj, j£j, 

iSX+(n) 

(3) over Aj > 0, i £ X + (n), 
where for n £ \ {0} and A+ = (Aj : i £ X+(n)) £ m)+ +{n)l , 



(4) G n (A + ) 



Kin?^-A if«G(0,oo)\{l}, 

(n) 

KjnjlogAj, ifa = l, 



and the value of the right member above is taken to be — oo if a £ [l,oo) 
and Aj = for some i £ T+ (n). The resulting allocation is called a weighted 
a- fair allocation. 

Various properties of the mapping A : — ► are developed in Ap- 
pendix A of this paper. In particular, for each n £ M^: 

(i) Aj(n) > for each i £ X + (n), 

(ii) A(rn) = A(n) for each r > 0, 

(hi) Aj(-) is continuous at n for each i £ X + (n), and 
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(iv) there is p G (not necessarily unique and depending on n) such 
that 

( Ki \ 

(5) Aj(ra) = m — — - — for each i G Z+(n), 

V^. ; , jPj A jiJ 

where 

(6) Vj (Cj - AjiAi (n)j = for all j G J. 

The (pj :j G J7") are Lagrange multipliers for the optimization problem (1)- 
(3), one for each of the capacity constraints (2). [Note that for each i G 
Z+(n), since Aj(n) > [by (i)] and rij > (by definition), the fact that the 
representation (5) holds implies that p is such that the denominator in the 
right member of (5) does not vanish.] 

When Ki = l,i Gl, the cases a — > 0, a — ► 1 and a — > oo correspond re- 
spectively to an allocation which achieves maximum throughput, is propor- 
tionally fair or is max-min fair [2, 17]. Weighted a-fair allocations provide 
a tractable theoretical abstraction of decentralized packet-based congestion 
control algorithms such as TCP, the transmission control protocol of the In- 
ternet. Indeed, if a = 2 and Ki is the reciprocal of the square of the round trip 
time on route i, then the formula (5) is a version of the inverse square root 
law familiar from studies of the throughput of TCP connections [11, 16, 18]. 
The relations (2) and (3), (5) and (6) and more refined versions of these 
relations, can be solved by iterative methods to give predictions of through- 
put, given the numbers of flows N(t) present at time t [7, 12, 20]. Given 
a distribution for N(t), the overall network performance can be predicted. 
But a major difficulty with this approach is the choice of the distribution 
for N(t). For example, if flows arrive on different routes as independent Pois- 
son processes and if flows on a route remain in the system for independent 
and identically distributed holding periods, then the stationary distribution 
of the process N is easy to describe: the components are independent, each 
with a Poisson distribution, whatever the distribution of holding periods. 
This model is indeed used in [12] and might be appropriate for real-time 
flows whose time in the system is unaffected by their allocated bandwidth. 
But for many flows, for example, document transfers, their length of time in 
the system is affected by their allocated bandwidth, and this may produce 
correlations between the components of N(t) which need to be understood. 
Roberts and Massoulie [19] have begun the study of a stochastic model that 
captures this effect. 
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4. Stochastic model. An active flow on route i corresponds to the con- 
tinuous transmission of a document through the resources used by route i. 
Transmission is assumed to occur simultaneously through all resources on 
route i. The number of active flows on route i at time t is denoted by 
Ni(t). The stochastic process N = {(N±(t), . . . ,Ni(t)), t > 0} is assumed to 
be a Markov process with state space and infinitesimal transition rates 
q:Z\ xZ^l given by 

(7) q(n,m) = Vi if m = n + ej, 

(8) q(n,m) = fiiAi(n) if to = n — ej, rii > 1, 

(9) q(n, m) = otherwise, 

for each n,m G Hi, i Gl, where, for each i, f« > and /ij > are fixed 
constants, and e, is the I-dimensional unit vector whose ith component is 1 
and whose other components are all zero. 

This corresponds to a model where new flows arrive on route i according 
to a Poisson process of rate Vf, for i such that Ni(t) / 0, Ai(N(t))/Ni(t) 
is the bandwidth allocated to each active flow on route i at time t; and a 
flow on route i transfers a document whose size is exponentially distributed 
with parameter //j. This is the model of Roberts and Massoulie [19] with 
exponential document sizes. 

From the results of de Veciana, Lee and Konstantopoulos [9] and Bonald 
and Massoulie [2], we know that the Markov chain N is positive recurrent if 

(10) ^Aji/HKCj, 

where pi = Vi/ in for all i G X. These are natural constraints: pi is the average 
load produced by route i, and we can identify the ratio of the two sides of 
the inequality (10) as the traffic intensity at resource j. Indeed, condition 
(10) is necessary for positive recurrence of N. For a proof, suppose that 
N is positive recurrent and fix j G J . The virtual waiting time Vj(t) for 
resource j at time t is the amount of time, measured from time t onwards, 
that it would take to complete the transfer of all of the documents that 
are being transmitted through resource j at time t, assuming that external 
arrivals are turned off after time t, that is, no new documents are accepted 
for transmission after time t, and that all other resources are given infinite 
capacity, that is, Ck = +oo for all j, after time t. The virtual waiting 
time thus measures the time it would take for resource j to become idle if 
there were no more arrivals after time t and if resource j could work at full 
capacity from time t. Suppose that the network starts empty. The positive 
recurrence of N implies that the mean time for the virtual waiting time 
process Vj to return to zero (after first moving away from zero) is finite. 
Consider another network with the same features as the original one, except 
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that Ct = +00 for all k^j. Let Vj denote the virtual waiting time process 
for resource j in this network. When the same arrival and document size 
processes are used for the two networks, Vj(t) < Vj(t) for all t. In particular, 
the mean time for Vj to return to zero must be finite. Now, Vj is equivalent in 
distribution to the virtual waiting time process for a multiclass single server 
queueing system operating under a work conserving service discipline. This 
system has one queue for each i such that Aji = 1. The queue associated 
with such an i has an infinite capacity buffer, Poisson arrivals at rate z/j, 
i.i.d. exponential service times with a mean of and the server serves at 
a maximum rate of Cj. The virtual waiting time process for this queueing 
system is the same for all work conserving service disciplines and it is well 
known that the mean time for this process to return to zero is finite if and 
only if J2iei ^jiPi < Cj. Since j was arbitrary, it follows that (10) must hold. 

It is an open question whether, in the generalization of the above model 
to allow arbitrarily (rather than exponentially) distributed document sizes, 
the condition (10) is sufficient for stability. 

5. Main results. Our aim in this paper is to begin to explore the behavior 
of the Markov chain {N(t),t > 0} when 



and some of the constraints are saturated, that is, some of the resources are 
in heavy traffic. Thus, we henceforth assume that (11) holds and that 



Let J* = |J7*| and without loss of generality assume that the first |J*| ele- 
ments of J correspond to the set J*. 

Here, we focus on understanding the behavior of fluid model solutions, 
which can be thought of as formal limits of the stochastic process N under 
law of large numbers scaling. The following notions are used in the definition 
below. A function / = • • • , /i) : [0, 00) — > is absolutely continuous if 
each of its components fi : [0, 00) — > R+, % = 1, ... ,1, is absolutely continuous. 
A regular point for an absolutely continuous function /: [0, 00) — > is a 
value of t G [0, 00) at which each component of / is differentiable. [Since / is 
absolutely continuous, almost every time t 6 [0, 00) is a regular point for /.] 

Definition 5.1. A fluid model solution is an absolutely continuous 
function re: [0, 00) — ► such that at each regular point t for re(-), we have 
for each i G X, 



(11) 



(12) 




(13) 




if rii(t) > 0, 
if m{t) = 0, 
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and for each j G J , 

(14) Y, AjMn(t))+ Y A 3iPi< c i> 

ieX+(n(t)) ieX (ra(i)) 

where 2+(n(i)) = {» G X : n*(t) > 0} and Zo(n(i)) = {i£l: n<(t) = 0}. 

Motivation for this definition is given in Appendix B through a fluid limit 
result. For the moment, we observe that if rii{t) > 0, then the right-hand 
side of (13) is the infinitesimal drift of N^t) when N^t) > [cf. (7) and (8)]. 
On the other hand, if rij(t) = and t is a regular point for n(-), then the 
derivative of rii(-) at t is forced to be zero since nj(s) > for all s > — 
to see this, consider the left- and right-hand derivatives of n-j(-) at t. This 
property may seem counterintuitive, however, this phenomenon is common 
in fluid models for queueing systems. It reflects the fact that a fluid model 
solution is obtained as a (formal) law of large numbers limit from the original 
stochastic model, and consequently a fluid model solution state n G for 
which m = can be the limit of rescaled states in the stochastic model 
where the ith component is at or near zero. The inequality (14) is derived 
from the fact that, in the stochastic model, the cumulative unused capacity 
for each resource is a nondecreasing process. As in the derivation of the 
differential equation (13), some care is needed here in treating routes i, for 
which rii(t) = 0. One might paraphrase (14) as saying that the total fluid 
model bandwidth allocation for each resource cannot exceed its capacity, 
where the allocation to any route i satisfying rii(t) = is pi at time t. For 
a more detailed justification, we refer the reader to Theorem B.l and its 
proof. 

Following Bramson [5] , we now define an invariant manifold for fluid model 
solutions. 

Definition 5.2. A state no G R\ is called invariant if there is a fluid 
model solution n(-) such that n(t) = uq for all t > 0. Let A4 a denote the set 
of all invariant states. We call M a the invariant manifold. 

The following is a simple characterization of ./VI Q . 

Lemma 5.1. The set of invariant states, M a , is given by 

(15) {n G R I + :A i (n)=p i for all i€l + (n)}. 

Proof. Let M a denote the set in (15). Note that M a and M a are 
nonempty since they both contain the origin in M^. 
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To show that M a cA/" a , suppose that no G M a - If n o = 0, then it follows 
trivially that no G N a . If no 7^ 0, then there is a fluid model solution n(-) 
satisfying n(t) = hq for all t > and so it follows from (13) that 

(16) Aj (n ) = ft for all JGl + (n ). 

Conversely, to show that j\f a C Ai a , suppose that no G M a . Then, by (11), 
n(t) = no for alH > satisfies (13) and (14) for all t and all i G X, j G ^7, and 
so n(-) is a valid fluid model solution. Hence no is in .M Q . □ 

The following alternative characterization of the invariant states will also 
be used. It is proved in Section 6. 

Theorem 5.1. A state n G Ri is an invariant state if and only if there 
is q G R+* such that 

(17) ru = Pi ( ^ jeJ - K 9jAji y a for alii EX. 

Remark 5.1. In fact, as examination of the proof of the above theo- 
rem reveals, for an invariant state n, there is a one-to-one correspondence 
between the vectors q appearing in the above representation of n and the 
Lagrange multipliers p appearing in the characterization of A + (n) given in 
Lemma A. 4 [see also (5) and (6)]. This correspondence is obtained by taking 
the entries of q to be given by the entries of p with indices j G J* and noting 
that the other entries in p are necessarily zero. 

For each n G Ri., we define the distance of n from A4 a as 

(18) d(M a ,n) = M{\\v-n\\:veM a }- 

The following theorem shows that, starting in any compact set, fluid model 
solutions converge uniformly towards the invariant manifold. This theorem 
is proved in Section 6. 

Theorem 5.2. Fix R G (0,oo) and e > 0. There is a constant T^ e < 00 
such that for each fluid model solution n(-) satisfying ||n(0)|| < R we have 

(19) d(M a ,n(t)) <e for all t > T Rj£ . 

In the course of proving Theorem 5.2, in Section 6, we prove the following 
(see Theorem 5.3) alternative characterization of invariant states. For this, 
define w(n) = (wj(n) :j G J*) for n G R?j_ to be given by 

(20) w j (n) = Y / A j i-, 
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We call w(n) the workload associated with n. Let 

(21) F{n) = -^J2 v iWi~ 1 (ffi + for all n G M} + . 

This function F was used in [2] as a Lyapunov function to show positive 
recurrence of N under the conditions (10). An intuitive interpretation of the 
function F is as follows. If the the number of flows on each route is fixed and 
given by the components of n G ML, then by Little's law nj/z^ is the mean 
time that a flow on route i spends in the system and the time a flow on 
route i spends in the system is exponentially distributed with mean rij/i^. 
The (a + l)st moment of this random variable is T(a + 2)(nj/i^) a+1 , where 
r(-) is the usual Gamma function. Thus, given n, F(n) can be interpreted 
as a weighted sum over the routes, where for route i, the summand is the 
weight fjKj^" _1 /(a + 1)T(q + 2) times the (a + l)st moment of the amount 
of time spent in the system by a flow on that route. 

For w G M+*, define A(w) to be the unique value of n G R5_ that solves 
the following optimization problem: 

minimize F(n) 

(22) subject to ^^4ji— > Wj, j G J*, 

i<EZ ^ 

over 7ij > 0, i El. 

Remark 5.2. Since A has full row rank and its only entries are zeros 
and ones, for each w G Ri*, the feasible set of the optimization problem 
(22) is nonempty, and then since F is nonnegative on and F{n) — ► oo as 
||n|| — > oo, (22) has an optimal solution. By the strict convexity of F, this 
solution is unique. 

Theorem 5.3. A vector n G is an invariant state if and only if 
n = A(w(n)). 

The map A : R + * — > plays an analogous role for the flow-level model 
of [19] to the lifting maps occuring in Bramson's work [3, 4] on the asymp- 
totic behavior of fluid models associated with multiclass queueing networks 
operating under certain head-of-the-line service disciplines. It is natural to 
conjecture that one might prove a state space collapse theorem for the flow- 
level model in an analogous manner to that in [5], and extend the diffusion 
approximation results developed for multiclass queueing networks in [21], to 
prove a diffusion approximation for the flow- level model. This suggests that, 
under suitable rescaling and initial conditions, a diffusion approximation for 
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the J* -dimensional workload process W = {W(t) : t > 0} denned by 

(23) W0(*)=£47i — ' J^*. 

is likely to be a reflecting Brownian motion W living in the workload cone 

(24) W a = A*M- 1 M a , 

where M = diag(ju), A* is the J* x I matrix obtained from A by eliminating 
those rows of A that are not indexed by elements of J* , and 



(25) , 

i G I, for some q G > . 

Here the direction of reflection on the boundary surface corresponding to 
qj = is the unit vector pointing in the direction of the positive jth coordi- 
nate axis. Furthermore, state space collapse should yield an approximation 
TV for N, under diffusion scaling, where N = A(W). This conjecture will be 
pursued in a subsequent work. 

To illustrate the conjecture, we consider the following simple example. 
Suppose that J = {1, 2} and X = {{1}, {2}, {1, 2}}, corresponding to a linear 
network with two resources and three routes. Let a G (0, oo), K% = \i{ = 1, 
for i = 1,2, 3, Cj = 1 for j = 1, 2, and p\ + ps = pi + ps = 1. Then the state 
space for the diffusion W is the cone 

W a = {(w 1 ,w 2 ):w 1 = piq\ /a + P3{qi + Q2) 1/a , 

w 2 = P2ql /a + Ps{qi + q2) 1/a , for some q 1 > 0, q 2 > 0}, 
which, for all a G (0, oo), is the same as the cone 

{(w 1 ,w 2 ):wi>0, w 1 p 3 <w 2 <wip 3 1 } 

pictured in Figure 1. Reflection occurs in the horizontal direction (corre- 
sponding to resource 1 incurring idleness) on the bounding face w\ = w 2 p-$. 
The interpretation of this is that although there is work for resource 1 within 
the system, congestion at resource 2 is preventing resource 1 from working 
at its full capacity. Similarly, vertical reflection (corresponding to resource 
2 incurring idleness) on the bounding face w 2 = w\pz is interpreted to mean 
that congestion at resource 1 is preventing resource 2 from working at its 
full capacity. Although the workload cone is the same for all a G (0, oo) in 
this example, this will not be the case, in general, for higher-dimensional 
workloads. 
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6. Proofs: characterization of invariant states and convergence to the 
invariant manifold. 



Proof of Theorem 5.1. It follows from Lemma 5.1 and the charac- 
terization of A + (n) in terms of Lagrange multipliers given in Lemma A. 4, 
that n G A4 a if and only if there is p G such that 

(26) m = Pi l^ 1 ' J i '' jAj ') for all i G Z+(n) 



and for all j G J , 



(27) Pj\ C j- L A JiPi)=°- 

iSX+(n) 

Note that for j G J~\ J7*, J2iei AjiPi < Cj and so (27) holds for such a j if 
and only if pj = 0. It follows that we can replace J by and J by J* in 




Fig. 1. T/ie workload cone W a for a network with two resources, with workloads labelled 
wi,W2, and three routes, with traffic loads labelled pi,p2,ps- Under the lifting map A, 
points (w\,W2) on the boundary w± — W2P3 are mapped to points (711,712,713), where ?ii = 
(and the corresponding q £ R+ has qi =0); similarly, points (wi,W2) on the boundary 
W2 — ttji/93 are mapped to points (711,712,773), where 712 = (and the corresponding q £ R+ 
has q2 = 0). 
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the above characterization of invariant states. The characterization given in 
the theorem can now be deduced as follows. 

First, consider an invariant state n and i G T\2+(n). Then n, = and for 
any j G J* such that Aji > 0, we have 

(28) ]T AjipiK^AjiP^Cj, 

ieT+(n) lei 

and so by (27), we must have pj = 0. On combining the above, we see that 

(29) ni = J E ^ PjA A 1/a foraUi€Z. 



Thus, any invariant state has the form given in (17) with qj = Pj for j G J*. 

Conversely, suppose that n is of the form given in (17) for some q G . 
Set pj = for j G J\ J7* and pj = qj for j & J*. Then, (26) holds immediately 
with in place of J , and so it suffices to show that p and n satisfy the 
complementarity condition (27) for each j € J*. The only way that this can 
fail to hold is if there is j G and i G I such that qj > 0, Aji > and 
Hi = 0. But, by the representation (17), n\ = implies that qj = for all 
j G J* satisfying Aji > 0. Thus, (27) must hold for all j £ J*. □ 

We need some preliminary lemmas before we can prove the other char- 
acterization of invariant states given by Theorem 5.3. For this, recall the 
definition of the function F from (21) in Section 5. For any fluid model so- 
lution n(-), F(n(-)) is absolutely continuous and at each regular point t for 

»(•)■ 

(30) iH.(«))=Es?« 



(31) = £ «*))>,- W*))) 

(32) = E J<^)7ii-A,(„( ( >) 

(33) =tf(n(t)), 
where, for each n G K^_, 

(34) tf(n) = £ ^(^)7^-A,(n 

i G X+(n) V ^ 7 VW 

Indeed, K(0) = and for nel^\{0}, 

(35) K{n) = VG n (A+'») ■ (A + >*(n) - A+(n)), 



-i 
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where A + (n) = (A;(n) : i G 2+(n)), A+>*(n) = : i G J+(n)), VG„(-) is the 
gradient of the function G n {-) defined in (4). 

Lemma 6.1. The function K is continuous on and 

(36) K(n)<0 for each neR* + , 
where the inequality is strict unless n is an invariant state. 

Proof. The continuity of K follows from the definition (34), combined 
with Lemma A. 3 and the fact that the term indexed by % in the sum in (34) 
is small if ni is near zero, since A(n) is bounded. 

If n = 0, K(n) = and is an invariant state. Now suppose that n^O. 
Then, by Lemma A.l, A + (n) solves the optimization problem (l)-(3) on 
(0,oo)l x +WI. Since A + = A + '*(n) is feasible for this problem and G n (A + ) is 
concave as a function of A + G (0, oo)' I +( n '' with a strictly negative definite 
(diagonal) Hessian matrix of second partial derivatives at each point, it 
follows from (35) that K(n) is nonpositive and that it is strictly negative 
unless A + (n) = A + '*(?i). More precisely, by (35) and Taylor's theorem with 
remainder, for v = A + (n) — A + '*(n), 

-K(n) = VG n (A+>*(n)) • v 

= G„(A+(n)) - G n (A+>») - \v ■ (V 2 G n )(A)v, 

for some A lying on the line segment between A + (n) and A + '*(ra) and 
where (V 2 G n )(-) denotes the Hessian matrix for G n {-). By the optimality 
of A + (n), G n (A + (n)) > G n (A + '*(n)), and by the strict negative definiteness 
of (V 2 G n )(A), it follows that the last line above is nonnegative and it is 
strictly positive unless v = 0. By Lemma 5.1, v = A + (n) — A +, *(n) = if and 
only if n is an invariant state. □ 

Corollary 6.1. At any regular point t for a fluid model solution n(-), 
we have 

(37) j t F(n(t)) = K(n(t))<0, 
where the inequality is strict unless n(t) 6 A4 a . 

PROOF. This follows from Lemma 6.1 and (30)-(33). □ 

For each n G R^_, let w(n) = (wj(n) :j G J*) be defined by (20). 

Lemma 6.2. For any fluid model solution n(-), t — > Wj(n(t)) is a non- 
decreasing function of t G [0, oo) for each j G J*. 
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Proof. Consider a fluid model solution n(-). Since n(-) is absolutely 
continuous, then so is the linear function w(n(-)) of n(-). From (13) and (14) 
satisfied by a fluid model solution, at a regular point t for n(-), we have for 
each j G J7"*, 

(38) ^(ra(t)) = £ A^-A^i))), 

iel+(n(i)) 

and (14) holds. Now, for j G JZ*, 

(39) C J= ^^ A , 

and on substituting this into (14) we obtain 

(40) J2 AjiKi{n{t))< A iiPi for all j G J*, 

ieZ+(n(t)) iel+(n(t)) 

and when combined with (38) this yields 

(41) iwj(n(t))>0 for all j € J"*. 

Since Wj(n(-)) is obtained by integrating its almost everywhere defined 
derivative, it follows that Wj(n(-)) is a nondecreasing function for each 

jeX. □ 

For each w G M+* , define F_(w) to be the optimal value attained in the 
optimization problem (22) and recall, from Section 5, the definition of A(w) 
as the optimizing value of n. 

Lemma 6.3. The functions F : R 3 + * — > R + and A : R J + * — > are contin- 
uous. In addition, F_ is a nondecreasing function, that is, if w and w are 
two vectors in ]R^* such that w <w, then F_(w) < F_{w). 

Proof. To prove the continuity of the functions F_ and A, consider a 
sequence {w m ,m = 1,2,...} in R 3 ^ converging to some w G By the 
growth property of F that F{n) — > oo as ||n|| — > oo, and the full row rank 
and nonnegativity assumptions on A, {A(w m ),m = 1, 2, . . . } is bounded. Let 
n be any cluster point of this sequence. Then, n satisfies the constraints of 
the optimization problem (22) and, by the continuity of F, F(A(w mk )) — > 
F(h) as k — > oo for some subsequence {m^} of {m}. By the feasibility of n, 
F(h) > F(A(w)). We claim that n = A(w). 

For a proof by contradiction, suppose that n / A(w). Then, by the strict 
convexity of F, e = F(h) — F(A(w)) > 0, and by the continuity of F, there 
is 5 > such that F{n) < F(A(w)) + e = F(h) for all n G R\ satisfying 
\\n- A(w)\\ <8. 
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Since w m — > w as m — > oo and A has full row rank, there is h G Wl such 
that ||n — A(u;)|| < 5 and A*M _1 fj > tu m for all m sufficiently large. [Here 
A* is the J* x I matrix of rank J* obtained from A by eliminating those 
rows of A that are not indexed by J* and M = diag(/i).] To see this, let A* 
be a I x J* matrix that is a right inverse for A*M _1 , that is, A^,M^ 1 A Jf = I, 
where I denotes the J* x J* identity matrix. By the definition of A(w), 
A^M -1 A{w) > w and so, since w m — ► w as m — > oo, there is m(5) sufficiently 
large that for all m > m(6), 

A„M~ 1 A(w) >w r , 



2VJ* 

where 1 denotes the J* -dimensional vector whose components are all l's. 
Let u = 27J=ptjj A]l - Note that II M H - i and A * M ~ lu = vJTjjlty" 1 - Define 
h = A(w) + u, where U{ = \u{\ for Then, h G R+, ||n — A(io)|| = ||u|| = 

||m|| < 5 and, since A*M _1 has nonnegative entries, 

A*M~ x h > A*M _1 A(u>) + ^M" 1 ^ > w m 

for all m>m(5), as desired. 

For m > m(5), n is feasible for the optimization problem (22) with w m 
in place of w, and since A(w m ) is optimal for this problem, it follows that 
F(h) > F(A(w m )). Hence, F{n) > lim fe F(A(u/ m J) = F(n). But this yields 
a contradiction, since < F(n) as ||n— A(io)|| < 5. Thus, n = A(w), and 
since n was an arbitrary cluster point of {A(w m ),m = 1,2,...}, it follows 
that this sequence converges to A(u>), and 

F(w) = F(A(w)) = limF(A(u/ m )) = limF(w m ). 

This implies the continuity of A and F. 

The nondecreasing property of F follows from the fact that, for w and w 
in satisfying w <w, any feasible solution for the problem (22) with w 
in place of w is feasible for the original problem with w. □ 

We have the following characterization of the optimal solutions of (22). 

Lemma 6.4. For each w G a vector n G is the unique optimal 
solution of (22) if and only if there is p G K+* such that for each i G T, 



(42) m = Pi 

and for each j G J* , 
(43) 



1/q 
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and 

(44) E^V>^- 

Proof. Fix w G R+* . For n G M} + and p G R+* , let 



(45) L(n,p) = F(n) + X) Pj ™J ~~ X -4j'» — I ■ 

Suppose that n G is the unique optimal solution of (22). Since A has full 
row rank, no row of A contains all zeros. Recall that A only contains zeros 
and ones. It follows that there is v G such that J2ieT ^jiJT > w j f° r eacn 
j G J*. Thus, Slater's constraint qualification (cf. [8], page 236) is satisfied 
and by the necessity theory of Lagrange multipliers (cf. [15], Theorem 1, 
page 217), there is p G Ri* such that n minimizes L(-,p) over R+ and (43) 
holds for all j £ Now, L(-,p) is continuously differentiable and so for 
each i G X we have 

8L 

where the inequality is an equality when rtj > 0. If rii > 0, this yields (42) 
immediately. If rij = 0, this yields 



(47) = m > Pi 



^j£j*PjAji\ ^ 



Since p has nonnegative components, it follows that the above holds with 
equality. Thus, (42) holds for all % GX. Inequality (44) holds for each j G J7*, 
since n is feasible for (22). 

Conversely, suppose that n G and p G satisfy (42)-(44). Then 

BL 

(48) ( n , p )=o far all t GX, 

and it follows from the strict convexity of F that n is a global minimum 
for L(-,p) over R|j_. By (44), n is feasible for (22) and for any other feasible 
h G R+ we have 

(49) F(n)=L(n,p)<L(n,p)<F(n), 

where we have used (43), the feasibility of n and the fact that p G R+* . Thus, 
n is optimal for (22). □ 



We can now prove Theorem 5.3. 
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Proof of Theorem 5.3. By Theorem 5.1, n G Iq_ is an invariant state 
if and only if there is p = q G B^.* such that (42) holds for all i 6l. Note 
that if w = w(n), then (43) and (44) automatically hold for all j G J*. Thus, 
on combining the above with Lemma 6.4, we see that n G is an invariant 
state if and only if n is the unique optimal of (22) with w = w(n). The latter 
is equivalent to n = A(w(n)). □ 

For the proof of the convergence result, Theorem 5.2, we introduce the 
following function H and prove some of its elementary properties. For each 
hGMJ., let 

(50) H(n)=F(n)-F(w(n)). 

Lemma 6.5. The function H:M^_—>'R is continuous. Furthermore, it 
is zero on the set of invariant states, M a , and it is strictly positive on 
R\\M a . 

Proof. The continuity of H on follows from the facts that F is 
clearly continuous, F_ is continuous by Lemma 6.3 and the linear function 
w(-) is continuous. 

Since n G is feasible for (22) with w = w(n), we have F(n) > F_(w(n)) 
and hence H(n) > 0. Furthermore, H{n) = if and only if n is the optimal 
solution for (22) with w = w(n), that is, if and only if n = A(w(n)). The 
latter occurs if and only if n is an invariant state by Theorem 5.3. □ 

Lemma 6.6. For any fluid model solution n(-), t — > H(n(t)) is a nonin- 
creasing function oft€ [0, oo). 

Proof. By Corollary 6.1, F(n(t)) = F(n(0)) + / * K(n(s)) ds is a non- 
increasing function of t, and, by the combination of Lemmas 6.2 and 6.3, 
F_(w(n(t)) is a nondecreasing function of t, and so it follows that H(n(t)) = 
F(n(t)) — F_(w(n(t))) is a nonincreasing function of t. □ 

Remark 6.1. A stronger form of Lemma 6.6, which shows that H(n(t)) 
is strictly decreasing at times where n(t) ^ M. a will be developed and used 
in the proof of Theorem 5.2 given below. 

PROOF of Theorem 5.2. Fix R G (0, oo) and s > 0. Let 

F(R) = sup{F(v) : v G ||v|| < R}. 

Since F is continuous on R 1 ^, F(R) is finite. For any fluid model solution 
n(-) satisfying ||n(0)|| < R, on integrating (37) we see that F(n(t)) < F(n(0)) 
for all t > 0. The fact that F(n) — > oo as ||n|| — ► oo implies that there is a 
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closed ball B (depending on R) in R5. that is centered at the origin and 
of finite radius such that F(v) > F(R) for all v G B c , where B c denotes 
the complement of B in Mr,. Combining the above, we see that for any fluid 
model solution n(-) satisfying ||n(0)|| < R, we have n(t) G B for all t G [0, oo). 
Let 

(51) D = {veB:d(M a ,v)>e}. 

Note that D depends on R and e. By Lemma 6.5, the function H is con- 
tinuous and strictly positive on the compact set D. It follows that 5 = 
m£{H(v) : v G D} is strictly positive, and the set D = {v G B : H{v) > 5} 
contains D. Moreover, since H is zero on A4 a , D does not meet M a . 

Consider a fluid model solution n(-) satisfying ||n(0)|| < R. Let T(n) = 
inf{t > 0:n(t) ^ D}. Since n(-) remains in B for all time, it follows that if 
T(n) < oo, then n(-) exits D by violating the constraint H(n{-)) > 5. Then, 
since H{n{-)) is a nonincreasing function (by Lemma 6.6), it follows that 
n(t) ^ D for all t > T(n). Consequently, since D C D, we have d(M a ,n(t)) < 
e for all t > T(n). We now develop an upper bound on T(n). For < t < 
f(n), 

(52) H{n(t)) - H(n(0)) < F(n(t)) - F(n(0)) 

(53) = f K(n(s))ds, 

Jo 

where we have used the nondecreasing property of F_(w(n(-))) for the first 
line, and Corollary 6.1 for the second line. By Lemma 6.1, K is continuous 
and strictly negative off the manifold M a . It follows that there is Cr )£ > 
such that K is bounded above by —Cr )E on the compact set D which does 
not meet M. a . Then, the above yields 

(54) f(n) < H R /C Rt£ , 

where Hr = s\ip{H(v) : ||«|| < R} < oo, and the desired result follows. □ 

APPENDIX A: 

Properties of the bandwidth sharing policy. For each n G \ {0} and 
A+ G Mf +(n)l , recall the definition of G n (A+) from (4). 

Lemma A.l. For each n G \ {0}, there is a unique optimal solution 
A + (n) = (A.; : i G 2+(n)) of (l)-(3); each of its components is strictly positive 
and uniformly bounded by C* = max, e j Cj . 

PROOF. Fix n G M.\ \ {0}. If a G (0, 1), the objective function G n (A+) 
is continuous and strictly concave as a function of A + = (Aj :i G 2+(n)) on 
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(55) < A + : Aj > for all i G Z+(n), ^jiK < Cj for all j e J 



It follows that the maximum of the objective function is achieved on this set, 
and by the strict concavity, the maximizing point is unique. Furthermore, 
since the derivative of the objective function with respect to Aj diverges 
to +oo as Aj | for each i G I + (n) , the maximizing point must have strictly 
positive components, that is, the maximum cannot occur with one of the 
Aj , i G Z + (n) , equal to zero. 

For a G [l,oo), by convention, the objective function takes the value — oo 
whenever Aj = for some i G I+in). Thus, any optimal solution to (l)-(3) 
has all components strictly positive. In fact, if we let c = imnj^j Cj/\I\, 
then the |Z_|_ (re) [-dimensional vector A + with all components set equal to c 
satisfies the constraints in (l)-(3). By combining the fact that the feasible 
set for (l)-(3) is bounded with the fact that the objective function diverges 
to — oo as Aj — ► for any i G I + (n), we may conclude that there is 5 > 
such that for all A + = (Aj :i G X + (n)) satisfying the constraints in (l)-(3) 
and such that Aj < 5 for at least one i G 2+ (re), we have G n (A + ) < G n (A + ). 
It follows that any optimal solution of (l)-(3) must be in the compact set 



(56) I A + : 5 < Aj for all i G Z+(n), ^ < Cj for all j G J 



Since the objective function is continuous and strictly concave on this com- 
pact set, it has a unique maximum there, which is the optimal solution of 



Since for each i El, there is j G J such that A™ = 1, it follows that 
A+(n) < C* = max je: 7 Cj for all i£l + (n). □ 

In the following lemmas, A:R^_ — ► is as defined in Section 3. 
Lemma A. 2. Fix n G R+. Then, A(rn) = A(n) for all r > 0. 

Proof. Fix r > 0. Note that X+(n) = l + (rn) and so Aj(n) = Aj(rn) = 

for all i G X (n) =X\X + (n). Also, for A+ G M^ +(n)l , G rn (A+) = r Q G n (A+) 
[for all a G (0, oo)], and since the constraints in (l)-(3) do not involve n, it 
follows that the optimal solution A + (rn) for the objective function G rn (-) 
is the same as the optimal solution A + (n) for the objective function G n (-), 
and hence A + (rn) = A + (n). □ 

Lemma A. 3. For each i G I, the ith component Aj(-) of the function 
A(-) is continuous on {n G : n, L > 0}. 



iex+(n) 



iSX+(n) 



(l)-(3). 
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PROOF. We first prove continuity of A(-) at n* G Iq_ satisfying n* > 
for all i G X. In this case, X + (n*) =T and, by Lemma A.l, the optimal 
solution A(n*) to the problem (l)-(3) with n = n* has all components strictly 
positive. Given e > 0, let B £ be a nonempty open ball that is centered at 
A(n*), that has radius less than or equal to e and that is a positive distance 
from the boundary of the orthant Rj_. Let D £ denote the compact set of 
A G that satisfy the constraints of (l)-(3) with n = n* there and that 
are outside of B £ . 

We claim that there is r] > 0, (3 G (0, min* =1 n*) and <5 > 0, such that 

(57) G n * (A) < G n * (A(n*)) - r? for all AeD e 
and 

(58) G n (A)<G n .(A(n*))-»//2, 

whenever n G R+ satisfies ||n — n*|| < (3 and A G -D £ is such that Aj < 5 for 
at least one i G I. (Note that raj > for all i when \\n — n*\\ < (3.) The first 
inequality (57) can be proved by contradiction. For if (57) does not hold for 
some r\ > 0, then for each positive integer k, there is A k G D e such that 

(59) G n *(A k )>G n *(A(n*))-l/k. 

By the compactness of D £ , there is A G D £ such that {A fe } converges to A 
along some subsequence. If a G (0,1), then by the continuity of G n *(-) on 
[0, oo) 1 in this case, it follows on passing to the limit in (59) that G n *(A) > 
G n *(A(n*)), which contradicts the uniqueness of the optimum A(n*). If a G 
[l,oo) and Ag (OjOo) 1 , then the continuity of G n *(-) on (OjOo) 1 yields a 
contradiction in the same manner as for a G (0,1). Finally, if a G [1, oo) 
and A, = for some i G X, then since A^ — > Aj along some subsequence of 
/c's, either /qn*log A k — > -oo (if a = 1) or K i (n*) a (A k ) 1 ~ a /(l - a) —> -oo (if 
a > 1), along this same subsequence. The other terms (indexed by I ^ i) in 
the sum constituting G n *(A k ) are either bounded (if A; ^ 0) or go to — oo 
(if A; = 0) as k — > oo along the subsequence. Consequently, G n *(A k ) — ► — oo 
as — > oo along the subsequence, which contradicts (59). Now we show the 
result pertaining to (58). For a G (0, 1), it follows from the uniform continuity 
of G n (A) as a function of (n,A) on any compact subset of [0,0c) 1 x [(Loo) 1 
that there is (3 G (0,minj =1 n|) such that 

G n (A)<G n *(A) + ?? /2 

for all n G satisfying ||n — n*\\ < (3 and A G D £ . Inequality (58) fol- 
lows immediately from this together with (57). For a = 1 and fixed (3 G 
(0,minf =1 ?i^), if n G satisfies ||n - n*\\ < [3, 5 G (0, 1), A G D £ and i G J 
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such that Aj < 5, then we have 

(60) G? ft (A)=J>,n,logA, 

l&T 

(61) < Ki(n* -P)log6 + J2 + P) log(C* V 1), 

where C* = maxj^j Cj. Since n* — (3 > 0, by the definition of (3, the last line 
above goes to — oo as 5^0. As there are only finitely many indices i G X 
and the right member of (58) does not vary with n or A, it follows that for 
any fixed (3 G (0, min* =1 n^), there is 5 G (0, 1) such that whenever n G ML 
satisfies \\n — n*\\ < (3, AG D e and Aj < 5 for some i El, then (58) holds. 
A similar proof of (58) holds for a G (1, oo), with the exception that (61) is 
replaced with 

(62) G n {A) < Ki(n* - fi) a ^—^-. 

1 — a 

Since G n (A) is uniformly continuous as a function of (n,A) on any com- 
pact subset of (0, oo) 1 x (0, oo) 1 , there is 7 G (0, [3/2) such that for all n G 
satisfying ||n — n* \\ < 7 and A G D £ satisfying Aj > 5 for all i G X, we have 

(63) G n (A)<G n .(A) + ?? /2 
and 

(64) G n (A(n*))>G n ,(A(n*))- V /2. 

Combining (57) and (58) with (63) and (64), we see that for all n G 
satisfying ||n — n*\\ < 7 and all A G D e , we have 

(65) G n (A) < G n * (A(n*)) - r//2 < G n (A(n*)). 

It follows that for all n G satisfying ||n — n*\\ < 7, the optimal solution 
A(n) of (l)-(3) must lie within B e , and hence must be no more than distance 
e from A(n*). Hence, A(-) is continuous at n* . 

Next, consider n* G M 1 ^ \ {0} such that n* k = for at least one k. We will 
show that as a function of n, (Aj(n),i G T + (n*)) is continuous at n = n* . 
Small perturbations n of n* that only involve changes to the strictly positive 
components of n* can be handled in a similar manner to that in the first 
three paragraphs of this proof. It is perturbations for which I + (n) ^ T + (n* ) 
that require additional argument, which we give below. For fixed e > 0, 
let B £ be a nonempty open ball in R^ + ^ n that is centered at A + (n*) = 
(Aj(n*) :i G X + (n*)), that has radius less than or equal to e and that is a 
positive distance from the boundary of the orthant M^' + ^ n . Let D £ denote 
the set of A G R^ + ^ n ^ that satisfy the constraints of (l)-(3) with n = n* 
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there and that are outside of B £ . In a similar manner to that for (57), by 
the strict concavity of G n *(-) on the interior of R|^ + ^ n ^' and its behavior 
near the boundary of R|^ + ^ n , there is rj > such that 

(66) G n *(A)<G n *(A + (n*))-ri forallAeD e . 

We now need separate arguments for the cases a < l,a > 1 and a = 1. 
We first consider a G (0, 1). For a vector n G such that nj > for all 

z € 2+(n*), the vector A + (n*) G ]Rjf + ^ n can be extended, by the addition of 

zero components, to a feasible vector in M^' +( ' r ^' for the optimization problem 
(l)-(3) associated with n. Using the fact that 

(67) Ki nf 



1 — a 



is zero when Aj = and a G (0, 1), it follows that the optimal solution A + (n) 
of the optimization problem (l)-(3) satisfies 

(68) G n (A + (n))>G^(A+(n*)), 

where n is the vector in ML obtained by resetting the entries in n indexed by 
i G I+(n) \I + (n*) to zero. Let A + (n) denote the vector in R^ + ^ n " consisting 
of the components of A + (ra) indexed by i 6l + (n*). Then, 

(69) G n (A + (n)) = G h (A + (n))+ ^ 



l-Q 

i6Z-)-(n)\Z+(n*) 

Since (67) tends to zero as n« — > 0, uniformly for Aj G [0, C*], there is 71 > 
such that whenever \\n — n*\\ < 71, we have rii G (0, 00) for all i G X + (n*) and 
the sum on the right-hand side of (69) is less than 77/6. Combining this with 
(68), we see that for \\n — n*\\ < 71, 

(70) Gn(A+{n)) > Gn(A + (n*)) - r//6. 

By the uniform continuity of Gfj,(A + (n*)) = Siex + (n*) K i n f — i i_ a as a 

function of (rij:i G 2+(n*)) on compact subsets of (0, oo)' 2:+ ^ n *' ) ' , there is 
72 G (0,71) such that 

(71) \\G h (A + (n*))-G n *(A + (n*))\\ < 77/6, 
whenever ||n — n*|| <72- Combining (70) with (71), we obtain 

(72) ^(A + (n))>G„.(A + (n*))-?7/3, 

for all n G satisfying ||n — n* \\ < 72 • Finally, by the same kind of argument 
as in the second and third paragraphs of this proof [cf. (58)-(65)], but with 
A replaced by (Aj : i G 2+(n*)) and n replaced by (rij : i G Z+(n*)), using (66), 
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there is 73 G (0,72) such that for all n G satisfying ||n — n* 
A G D e , we have 



< 73 and 



(73) G„.(A+(n*))>G fi (A)+»7/2. 

Combining the above, we have that for all n G Ri. such that \\n — n*\\ < 73 
and A G £> e , 



(74) 



G n (A+(n))>G fi (A) + r 1 /6. 



It then follows that for \\n — n*\\ < 73, A + (re) is not in L> £ and it satisfies the 
constraints of the optimization problem (l)-(3) with n (or n*) in place of n, 
and so it must be in B e . This completes the proof of the desired continuity 
for the case a G (0, 1). 

For the case a G (l,oo), for ni > 0, the term (67) is negative, and is 

unbounded below as A,; [ 0. However the choice Aj = rij gives an evaluation 

. i — ™ 

KiTii/(l — a) that converges to zero as rii j 0. Since J2iex + ( 



(n*) K i n i T~r^ 



is uni- 



formly continuous as a function of ((rii, Aj) :i G X + (n*)) on compact subsets 
of ((0, 00) x (0, 00))^+^" " , there is 71 > such that whenever \\rii — n* \\ < 71 
and ||Aj — A^"(n*)|| < 71 for all i G 2+(n*), we have ni > 0, Aj > for all 
i G T+(n*) and 



(75) 



J2 Kj<^-G n .(A+(n*)) 
— 1 — a 

ieX + (n*) 



Since n* = for all i ^ X + (n*), there is 72 G (0,71) such that 

1 



(76) 

and 
(77) 



6 1 



a 



E 



K, n 7 - < 



iSX + (n)\J + (n*) 



5^ A i j?i i <(7i/2)AC j 

iSX+(n)\X+(n*) 



for all j G J , 



whenever n G RL satisfies ||n — ra*|| < 72- Then, for such n, the vector A G 

]R^ +(n)l defined by A4 = m for i G l+(n) \I+(n*) and A» = A+(n*) - 7l /2 
for i GX + (n*), satisfies the constraints of the optimization problem (l)-(3) 
for n. Since A + (n) is the optimal solution of that problem, using the same 
notation with tildes as for the case a G (0, 1), it follows, by the negativity of 
(67), that 



(78) 



Gn(A + (n))>G n (A + (n))>G n (A), 
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where, by (75) and (76), 

(79) G n (A) = — !— K t nfk\- a + -^- K ^ 

I — a J—f , 1 — a „ , "tt"' . , 

«GX + (n*) ieT + {n)\X + (n*) 

(80) >G B .( A +(rO)-2-3. 

The proof proceeds in the same manner as from (72) onward in the proof 
for a G (0, 1), to show that A + (n) is in B e whenever n is sufficiently close to 
n . 

If a = 1 and rij > 0, the term (67) becomes K^njlogAj, which may take 
positive or negative values. Nevertheless, one can proceed in a very similar 
manner to that for the case a G (1, oo), after observing that if A, = rii, then 
the evaluation Kj?ijlognj is negative for small enough rtj and approaches 
zero as rij | 0, while the maximum over < Aj < C* of the analogue of (67), 
namely KjnjlogC*, also approaches zero as rii J. 0. It follows that there is 
72 > such that for n G satisfying ||n — n*|| < 72, we have n.j > for all 
i G T+{n*) and 

(81) G fi (A + (n))>G n (A+(n))- ?? /12 

(82) > G n * (A+(n*)) - 77/12 - 77/12 - 77/6. 
The proof can then be completed as in the case a G (1, 00). □ 

Remark A.l. Aj(n) may be discontinuous at n if rij = 0. 

For n G R!l \ {0}, consider the Lagrangian 

L n (A + ,p) = G n (A + ) + Y,PAC j - £ AjiAi), 

is J' V iex+(n) / 

where G n (A+) is defined by (4), A+ = (A^z G I+(n)) G (0, oo)l x +^)l and 
P = (Pj '■ 3 ^ <7) ^ R+ is a vector of Lagrange multipliers. Since the optimal 
solution A + (n) of (l)-(3) has strictly positive components (cf. Lemma A.l), 
we can use the theory of Lagrange multipliers to characterize A + (n) as 
follows. 

Lemma A. 4. Fix n G \ {0}. A vector A + = (Aj :i G 1 + {n)) is the 
unique optimal solution of (l)-(3) if and only if there is p G such that 

(83) Pj (Cj- A H A i ) = f° r aU 3 G J, 

(84) £ pj Ajj > for all i G £f(n), 
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(85) 
and 

(86) A ii A i<Cj for all je J. 

Proof. By Lemma A.l, all of the components of the unique optimal 
solution A + (n) of (l)-(3) are strictly positive. Since Cj > for each j G 
J, there is A + G (0, oo)l x +C n )l satisfying Cj - T,iel+(n) A jiK > for each 
j G J . It follows that Slater's constraint qualification (cf. [8], page 236) 
is satisfied and so by the necessity theory of Lagrange multipliers (cf. [15], 
Theorem 1, page 217), there is p G Ri such that A + (n) maximizes L n (A + ,p) 
over A + G (0, oo)l J +MI , and (83) holds with Aj = A+(n) for all i G I+(n). 
Since A + = A + (ra) is an interior maximum for L n (-,p), which is continuously 
differentiable as a function of A + G (p,oo)> I+ ( n '\, it follows that 

(87) °=5 A ;( A+ W^) = ^(x^)J -Y,Pi A 3i foralHGJ+(n). 

The relations (84) and (85) follow immediately from this and the strict 
positivity of A + (n). Condition (86) follows from the constraints satisfied by 
A+(n). " 

Conversely, if p G and A+ = (Aj :i G Z + (n)) satisfy (83)~(86), then us- 
ing the strict concavity of L n (-,p) one can verify that (A + ,p) is a saddle point 
of L n (-, •) on (0, oo)' :r+ ( n )' x and so by a sufficiency theorem for Lagrange 
multipliers (cf. [15], Theorem 2, page 221), A + is an optimal solution of (1)- 
(3) with Aj > in place of Aj > for all i G X+(n). But since the optimal solu- 
tion of (l)-(3) has strictly positive components, it follows that A + is optimal 
for (l)-(3). 

□ 

Remark A. 2. Given n G ML \ {0}, while the vector p = p(n) may be 
non-unique, the vector (J2j£jPj( n )Aji,i G X + (n)) is unique by (85) and the 
fact that A + (n) is unique, by Lemma A.l. 

APPENDIX B: 

Fluid model solutions as fluid limits. We shall use the following repre- 
sentation for the Markov process introduced in Section 4: 

N i (t) = N i (0) + E i (t)-S i (T i (t)), i £l, i > 0, 
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where iV(0) is a random variable taking values in that is independent of 
E, S; E, S are independent I-dimensional processes with independent com- 
ponents such that for each i S I, E{ is a Poisson process with rate Vi and 
Si is a Poisson process with rate /ij. The sample paths of these processes 
are assumed to be right continuous on the time interval [0, oo) and to have 
finite left limits on (0, oo). The process T = {Ti'.i 6l) is a continuous, I- 
dimensional, nondecreasing process such that for each i, Ti(t) represents the 
cumulative amount of (bandwidth) capacity allocated to route i up to time 
t; it is given by 

(89) Ti(t)= I hi{N(s))ds foviel, t > 0, 

Jo 

where the function A(-) = (Aj(-) :i G X) is defined as in Section 3 using the 
optimization problem (1)~(3). For each j € J , let 

(90) Uj(t) = Cjt - ^2 AjiTi(t) = Cjt - (AT(t))j, t > 0. 

The process Uj is continuous and nondecreasing, and Uj(t) represents the 
cumulative unused capacity for resource j up to time t. 

We develop a fluid model, which can be thought of as a formal law of 
large numbers approximation for the stochastic model. For this, we consider 
a sequence of stochastic systems, indexed by r — > oo, in which the initial 
conditions may change with r, but E and S are kept fixed. The processes 
A^ r , jjr ^ j, r ^ <^ - n r ^ S y S tem are rescaled with law of large numbers 
scaling to yield new processes N ,U ,T ,E ,S : 



(91) N r (t) =N r (rt)/r, 

(92) U r (t) = U r (rt)/r, 

(93) T r (t)=T r (rt)/r, 

(94) E r {t) =E(rt)/r, 

(95) S r {t) =S(rt)/r 



for each t > 0. 

The sample paths of (T ,E ,S ,N ,U ) lie in the space £) 4I+J of func- 
tions defined from [0, oo) into M 4I+J that are right continuous on [0, oo) and 
have finite limits from the left on (0,oo). This space is endowed with the 
usual Skorokhod J\ -topology (cf. [10]). We say that a subsequence (which 
may be the whole sequence) of {(T ,E ,S ,N ,U )} is tight (resp. con- 
verges weakly) if the probability measures induced on D AI+3 by the subse- 
quence are tight (resp. converge weakly). We say that the subsequence is 
C-tight if it is tight and any weak limit point has continuous paths almost 
surely. 
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To obtain the fluid model, we formally pass to the limit as r — > oo in the 
rescaled versions of (88)-(90). In fact, we have the following theorem. For 
this, recall the definition of a regular point given before Definition 5.1. 

Theorem B.l. Suppose that {iV r (0)} converges in distribution as r — > 
oo to a random variable taking values in RL ■ Then, the sequence {(T ,E ,S ,N 
is C-tight, and any weak limit point (T, E, S, N, U) of this sequence ( obtained 
by taking a weak limit as r — > oo along a suitable subsequence), almost surely 
satisfies the following for all t > 0: E(t) = vt, S(t) = fit, 

(96) N i (t) = N i (0) + v i t-f H T i (t)>0, iel, 

(97) U j (t)=C j t-(AT(t)) j >0, jej, 

where U is nondecreasing and continuous, T is uniformly Lipschitz contin- 
uous with a Lipschitz constant bounded by C* = maxjgj- Cj , and T(0) = 0. 
Moreover, for a.e. sample point uj, T(-,lo) is absolutely continuous and for 
each regular point t for T(-,u), for each i £l, 

(98) ±T iM = \ A ^^ V e l + SM1' 
V ' dt n ' ; I A, ifi£l (N(t,uj)), 

and (13) and (14) hold at t for each i £l and j £ J , with n(-) = N(-,uj) 
there. 



Proof. For each value of r and t > 0, 

(99) T r (t) = - I"' K{N r {s))ds= C A(N r (s))ds, 

r Jo Jo 

where we have used the scaling property of Lemma A. 2. Hence, since A(-) 
is uniformly bounded by C*, for each r, t — > T r (t) is uniformly Lipschitz 
continuous with a Lipschitz constant bounded by C* . It follows immediately 
that the sequence of processes {T (•)} is C-tight. On combining this with the 
assumed convergence in distribution of {N (0)}, the functional weak law of 
large numbers for (E,S), (88) and (90), we_se_e that_{(T r ,E r , S r , N r ,TJ r )} 
is C-tight, and any weak limit point (T,E,S,N,U) of this sequence has 
the following properties: almost surely for all t > 0, E(t) = vt, S(t) = fit 
and (96) and (97) hold, where U is nondecreasing and continuous, and T is 
uniformly Lipschitz continuous with a Lipschitz constant bounded by C* , 
and T(0) = 0. 

To prove the remaining properties claimed in the theorem, without loss 
of generality, using the Skorokhod representation theorem and the fact that 
the sample paths of T, E, S, N, U are continuous with probability one, we 
may assume that all stochastic processes under consideration are defined on 
the same probability space and that almost surely 

(100) {(T r ,E r ,S r ,N r ,U r )} -» (T,E,S,N,U) u.o.c. 
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as r — > oo through a sequence {rfcj^Lp Here u.o.c. denotes uniformly on 
compact time intervals. 

Fix a sample point ui such that the convergence in (100) holds for uj. Then, 
T(-,uj) is absolutely continuous and differentiable at almost every time. Fix 
a regular point t for T(-,uj). This is the same as a regular point for N(-,u>), 
and U(-,uj) is differentiable there. For i ETo(N(t,uj)), Ni(t,uj) = and since 
Ni(-,co) > 0, we must have -^Ni(t,u) = 0, and hence, by (96), -^Ti(t,u>) = 
Pi = Vi/ni. For i G I + (N(t,uj)), Ni(t,uj) > and by the continuity of N(-,u), 
there is e > (depending on t and uj) such that Ni(s,u) > for all s£ + 
e]. By (100), N k (-,uj) — ► N(-,uj) uniformly on [t,i + e] as & — > oo, and it then 
follows from the continuity property of Lemma A. 3 that Ai(N rk (-,u>)) — > 
Ai(N(-,uj)) uniformly on [t,t + e] as k — > oo. Hence, by passing to the limit 
in (99) we obtain 

(101) Ti(s,uj) -Ti(t,u) = J Ai(N(u,uj))du for all s G [i, t + e]. 

It follows, that 4;Ti(t,uj) = Ai(N(t, uj)), since Aj(-) is continuous on {n G 
> 0}, by Lemma A. 3. Combining the above, we see that (98) holds 
at a regular point t for T(-,uj). Moreover, for each j G J, since Uj(-,uj) is a 
nondecreasing function, we have 

(102) 0<±U j (t,u) = C j - (A^T(t,u)j . 

Combining (98) and (102) with (96), we see that n(-) = N(-,uj) satisfies 
(13) and (14) for each i G I and j G J , at each regular point t for N(-,lo) 
[which is the same as a regular point for T(-,uj)]. □ 

Remark B.l. Theorem B.l shows that there exists a fluid model solu- 
tion for each possible starting point n(0) G R5_. Note, however, that we are 
not assuming uniqueness of fluid model solutions given the starting state. 
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